Outline: Applications of Neural Nets Nettalk -learning Pronunciation of English Text Classifying Sonar Targets 16.1 Nettalk 16.1.1 Overview Phoneme String Text Speech Figure 16.1: a Text-to-speech System Using Nettalk

نویسنده

  • T. J. Hazen
چکیده

NETtalk is a classic example of a back-propagation trained multi-layer perceptron network applied to a practical application. NETtalk, created by Sejnowski and Rosen-berg 1], applies a multi-layer network to the text-to-speech problem. The goal is to develop a system which can convert English text into its underlying sequence of phonemes and stress markers. The string of phonemes and stress markers can then be used by a speech synthesizer to generate an audio realization of the text as seen in Figure 16.1. In using a network approach to the problem, it was hoped that NETtalk could learn a general mapping of spelling to pronunciation. Other current text-to-speech products such as DECtalk, utilize a dictionary lookup for common and irregular English words and apply a set of phonological rules to convert words which don't appear in the NETtalk DECtalk

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Achieving High-Accuracy Text-to-Speech with Machine Learning

In 1987, Sejnowski and Rosenberg developed their famous NETtalk system for English text-to-speech. This chapter describes a machine learning approach to text-to-speech that builds upon and extends the initial NETtalk work. Among the many extensions to the NETtalk system were the following: a diierent learning algorithm, a wider input \window", error-correcting output coding, a right-to-left sca...

متن کامل

Parallel Networks that Learn to Pronounce English Text Terrence

This paper describes NETtalk, a class of massively-parallel network systems that learn to convert English text to speech. The memory representations for pronunciations are learned by practice and are shared among many processing units. The performance of NETtalk has some similarities with observed human performance. (i) The learning follows a power law. (;i) The more words the network learns, t...

متن کامل

A Comparison of the Classic NetTalk Text-to-Speech System to a Modern, Distributed Representation and Simple Recurrent Network

This paper reports on a comparison to the well-known NetTalk implementation of Engl!sh text-to-speech translation via neural networks. A distributed representation scheme for encoding is investigated opposed to the classic localist representation scheme used in the original NetTalk. The paper discusses a modem re-implementation based on Elman’s Simple Recurrent Network.

متن کامل

Nettalk: a Parallel Network That Learns to Read Aloud

Unrestricted English text can be converted to speech by applying phonological rules and handling exceptions with a look-up table. However, this approach is highly labor intensive since each entry and rule must be hand-crafted. NETtalk is an alternative approach that is based on an automated learning procedure for a parallel network of deterministic processing units. ~ f t e r ' training on a co...

متن کامل

Parallel Networks that Learn to Pronounce English Text

This paper describes NETtalk, a class of massively-parallel network systems that learn to convert English text to speech. The memory representations for pronunciations are learned by practice and are shared among many processing units. The performance of NETtalk has some similarities with observed human performance. (i) The learning follows a power law. (ii) The more words the network learns, t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994